Skip to content

Comments

demo commit pushed loader and model#1

Open
Emerald-Z wants to merge 2 commits intoECLAIR-Robotics:masterfrom
Emerald-Z:ocr-1
Open

demo commit pushed loader and model#1
Emerald-Z wants to merge 2 commits intoECLAIR-Robotics:masterfrom
Emerald-Z:ocr-1

Conversation

@Emerald-Z
Copy link

No description provided.

@Emerald-Z Emerald-Z closed this Mar 28, 2023
@Emerald-Z Emerald-Z reopened this Mar 28, 2023
@Emerald-Z
Copy link
Author

Emerald-Z commented Apr 1, 2023

Description

  • How this works: this takes in an image, reads and preprocesses one frame by isolating the box containing the digits and then isolating each digit. Then it feeds it into a forward NN to predict each digit and sums the values according to their weights(tens, ones, tenths)

Limitations

- Bounding box padding sometimes inaccurate with cutoff before the decimal place
  • Possible improvements:
    • Most important:
      • Using a 4 point transform so that the numbers can be viewed from an angle
      • Model not very accurate
    • Making bounding box detection faster? Currently I’m looping over all pixels and that could be potentially really slow
    • Replacing the feed-forward net with a CNN

Images:
Screen Shot 2023-03-28 at 10 59 53 AM

Screen Shot 2023-03-28 at 11 00 11 AM

Screen Shot 2023-03-28 at 11 00 23 AM

Screen Shot 2023-03-28 at 11 00 33 AM

Testcases: WIP

  • I wrote a simple test case with unittest that tests its capability to detect the right numbers - currently failing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant